Automatic detection of musicians' ancillary gestures based on video analysis
نویسندگان
چکیده
A novel approach for the detection of ancillary gestures produced by clarinetists during musical performances is presented in this paper. Ancillary gestures, also known as non-obvious or accompanist gestures are produced spontaneously by musicians during their performances and do not have meaning in sound, but they help in the creation of music. The proposed approach consists in detecting, segmenting and tracking points of interest and parts of the musician body in video scenes to further analyze if the movement associated to these points of interest or body parts could be related to ancillary gestures. In particular, we tackle the problem of detecting the three most commonly seen ancillary gestures of this class of musicians: clarinet bell moving up and down, bending of the knees and shoulder curvature. In this paper we show that the optical flux algorithm for tracking a point of interest at the bottom of the clarinet bell and the projection profile algorithm for analyzing the knees and the shoulder regions are effective in detecting ancillary movements related to the clarinet, knee movement and body curvature respectively. These techniques were evaluated with respect to the precision and recall in detecting ancillary gestures on 12,423 video frames of nine clarinetists’ presentations recorded in a studio. The experimental results have shown that the precision in detecting ancillary gestures varies between 78.4% and 92.8%, while the recall varies between 85.3% and 95.5%. These results also imply that any further analysis of the videos by specialists could focus on less than 500 frames which represents a reduction of more than 99% in the
منابع مشابه
Neural Network Performance Analysis for Real Time Hand Gesture Tracking Based on Hu Moment and Hybrid Features
This paper presents a comparison study between the multilayer perceptron (MLP) and radial basis function (RBF) neural networks with supervised learning and back propagation algorithm to track hand gestures. Both networks have two output classes which are hand and face. Skin is detected by a regional based algorithm in the image, and then networks are applied on video sequences frame by frame in...
متن کاملValidating kinematic displays for the perception of musical performance
Human gestures contain certain characteristics and meanings in communication and represent a link between intention and body. This paper describes a pilot study investigating the role of ancillary musical gestures in understanding musical meaning from the listener’s standpoint. We conducted a perceptual experiment using motion-capture recordings of musicians. Participants were presented video r...
متن کاملFacial Expression Recognition Based on Anatomical Structure of Human Face
Automatic analysis of human facial expressions is one of the challenging problems in machine vision systems. It has many applications in human-computer interactions such as, social signal processing, social robots, deceit detection, interactive video and behavior monitoring. In this paper, we develop a new method for automatic facial expression recognition based on facial muscle anatomy and hum...
متن کاملCompressed Domain Scene Change Detection Based on Transform Units Distribution in High Efficiency Video Coding Standard
Scene change detection plays an important role in a number of video applications, including video indexing, searching, browsing, semantic features extraction, and, in general, pre-processing and post-processing operations. Several scene change detection methods have been proposed in different coding standards. Most of them use fixed thresholds for the similarity metrics to determine if there wa...
متن کاملSummarization of Video - taped Presentations : Automatic Analysis of Motion
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. XX, NO. Y, SEPTEMBER 1998 Summarization of Video-taped Presentations: Automatic Analysis of Motion and Gesture Shanon X. Ju, Michael J. Black, Scott Minneman, Don Kimber Abstract| This paper presents an automatic system for analyzing and annotating video sequences of technical talks. Our method uses a robust motion estimation ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Expert Syst. Appl.
دوره 41 شماره
صفحات -
تاریخ انتشار 2014